<<<<<<< HEAD ======= >>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef How Bayesian Priors Affect Predicted Eagle Take at Wind Farms

Background

The U.S. Fish and Wildlife Service (FWS) uses a Bayesian framework to predict eagle collisions and fatalities at proposed wind energy sites. One of the appeals of Bayesian modeling is the ability to incorporate prior information and uncertainty regarding parameter values into estimates, ideally providing a better representation of reality. However, there is still debate over the appropriate choice and use of priors. Additionally, as Bayesian methodology is still gaining widespread familiarity, the ways in which prior information affects model outcomes can be unclear.

The purpose of this paper is to illustrate and quantify the effects of Bayesian priors used by FWS on the predicted outcomes of the eagle mortality model, in terms of predicted eagle fatalities. We use both empirical data from wind farms provided in a study presenting the methods used by FWS to implement the Bayesian prediction model (Bay et al. 2016), as well as a simulated dataset to demonstrate these effects both in theory and in practice.

Fully understanding the effects of priors is relevant to both energy developers and wildlife conservationists, because estimates of eagle take are used to determine permitting and mitigation costs.

Bayesian Model

The components of the Bayesian model used by FWS to predict eagle fatality are:

  • Eagle Exposure Rate: minutes per hour that eagles are estimated to fly within risk areas around turbines where collisions may occur.
  • Collision Rate: probability an eagle flying within a turbine risk area is struck and killed.
  • ExpansionFactor: constant converting fatality rate to annual fatalities, based on the size of a wind facility.

These parameters are represented as probability distributions, and combined to produce a posterior distribution representing the relative likelihood of how many ealges will be killed annually at a site.

\(Fatalities = Exposure * Collision Rate * Expansion\)

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Figure 1. Demonstration output of Bayesian eagle fatality model.

To be conservative, FWS uses the 80th percentile of this posterior distribution in setting take limits, and we display these values unless otherwise noted.

In this paper, we focus on eagle exposure rate, as this parameter is subject to both site-specific survey data, and a Bayesian prior distribution. The expansion factor is determined completely by the characteristics of a site. Similarly, the collision rate is estimated entirely by a prior distribution based on rates observed at existing wind facilities.

Survey Data

The data shown in Table 1 are the survey data used as example values for the eagle wind mortality model. These were derived from an empirical dataset (Appendix A in Bay et al. (2016)), which contains pre-construction eagle survey data from 26 wind energy sites. Survey effort indicates the amount of time and area covered during pre-construction surveys, and eagle observations are the duration of time over which eagles were observed flying within survey areas.

Table 1. Data from Bay et al. (2016). Effort was calculated as the product of survey area (km 2 ) and observation time (hr), derived from Appendix A.

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef


Combining Prior Information with Survey Data

Eagle flight time and survey effort are used to estimate eagle exposure at a given wind site. Figure 2 shows how the exposure measured at a site (green lines) is integrated with a consistent prior distribution of exposure probabilities (purple), to produce a probability distribution of exposure (yellow). This resulting distribution is used as the exposure rate to predict eagle fatalities at a site. The effect of the prior distribution is to modify extreme observed values that differ greatly from mean observations across a larger sample of sites.

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Figure 2. Eagle exposure rates used to estimate predicted fatalities across from a sample of wind farms. A consistent prior probability distribution (purple) is combined with site-specific survey data (green line) to estimate a distribution of exposure probabilities for a proposed project (yellow).

Using Priors Changes Estimates

At the majority of sites, the Bayesian estimate of eagle fatalities using prior exposure information was higher than a site-specific estimate (Fig. 2). These instances are indicated by points above the 1:1 line in Figure 2. The influence of eagle exposure priors on estimated eagle fatalities was less at facilities that expended greater effort on pre-construction surveys. This pattern is indicated by the sites with the most intensive surveys falling closest to a 1:1 relationship between the two estimates (Fig. 2).

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Figure 3. Relationships between estimated eagle fatalities produced incorporating prior information about eagle exposure at wind facilities, and those produced using only site-specific survey data. Plotted values are the 80th percentiles of the posterior distributions. Marker size is proportional to the size of the 80% CI produced from the posterior distribution using exposure priors.

Source: Bay et al. (2016)

Simulation Study

To investigate the effects of prior information and survey effort on predicted take, we created a dataset representing a range of possible pre-construction survey values for survey area, time, and eagle observations. We then produced eagle fatality estimates for these possible values using the FWS bayesian model.

Prediction Discrepancies Increase at Extreme Values

For eagle exposure rates < 1.21 min/hr*km3, predicted fatlity rates estimated using priors were higher than site-specific estimates (i.e. above the 1:1 line in Fig. 3). This pattern reversed when exposure was > 1.21. The threshold of 1.1 corresponds to the mean exposure rate of the prior exposure distribution. The actual value of this threshold is subject to change as prior distributions are updated with new survey data, but the pattern will be consistent. Figure 4 also illustrates that predicted fatalities estimated with and without priors are more similar when survey effort is greater (i.e., larger circles).

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

<<<<<<< HEAD Figure 4. Relationships between Bayesian and site-specific estimates of eagle fatalities, generated from simulated data. Plotted values are the 80th percentile of posterior fatality distributions, multiplied by the mean site scaling factor from Bay et al. 2016. ======= Figure 4. The mean of the exposure prior determines whether Bayesian estimates under or over predict fatalities relative to survey data Plotted values are the 80th percentile of posterior fatality distributions estimated with and without exposure priors, multiplied by the mean site scaling factor from Bay et al. 2016. Circle size is proportional to survey effort >>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

<<<<<<< HEAD

The prior distribution of exposure probabilities has a greater effect on estimates when observed exposure rate at a site is farther from the mean of the prior. Figure 5 illustrates this relationship in terms of how many standard deviations an observed exposure rate was from the prior mean . At one standard deviation above or below the mean, take estimates can differ by as much as 0.4 eagles per year.

=======

The prior distribution of exposure probabilities has a greater effect on estimates when observed exposure rate at a site is farther from the mean of the prior. Figure 5 illustrates this relationship in terms of how many standard deviations an observed exposure rate was from the prior mean. At one standard deviation above or below the mean, take estimates can differ by as much as 1.5 eagles per year.

>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

<<<<<<< HEAD Figure 5. Discrepancy between Bayesian and site-specific estimates of eagle fatalities, based on the difference between the observed exposure rate and prior mean (z-score). Discrepancies were calculated between the 80th percentile of the posterior fatality distributions. Values were generated from simulated data, and multiplied by the mean site scaling factor from Bay et al. 2016. ======= Figure 5. Discrepancy between Bayesian and site-specific estimates of eagle fatalities are greater at extreme values. Differences between the observed exposure rate and prior mean were standardized to z-scores. Discrepancies were calculated between the 80th percentile of the posterior fatality distributions. Values were generated from simulated data, and multiplied by the mean site scaling factor from Bay et al. 2016. >>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Greater Survey Effort Reduces Effect of Priors

Increased survey effort reduced the influence of prior distributions of exposure probability on estimates of fatalities, resulting in less discrepacy between predictions obtained using priors and those obtained using site survey data only. The magnitude of the effect of effort on estimate discrepancy was contingent upon the observed eagle exposure rate. This discrepancy decreased logarithmically with greater survey effort (Fig.6). The precision of posterior estimates, measured by the difference between the 80th percentile and mean estimate standardized by the mean, did not change as survey efforts increased.

<<<<<<< HEAD
=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Figure 6. Estimates of eagle fatality become more similar as estimated with and without priors with greater survey effort. Changes in a) difference between fatalities predicted with and without prior exposure information, and b) the size of the 80% CI of the predicted fatality distribution, relative to the mean, as a funciton of survey effort.

Harder to Prove Big Projects Take No Eagles

A bayesian approach to estimating fatalities guards against situations where surveys fail to detect eagles that are in fact present around the site. However, this means that sites where there are truly no birds will have some predicted take. In situations where no eagles are observed during pre-construciton surveys, greater survey effort moves predicted take closer to zero (Fig. 7). This response is highly dependent on the size of the wind farm. For the smallest sites, 48 hr.km2 of survey effort reduced fatality predictions < 1. Large sites with no eagle observations require more survey effort to reduce predicted take to zero (~300 hr.km2). however the rate of reduction is greater at large sites than at smaller sites.

Figure 7. Project size affects predicted take at sites where no eagles are observed. Changes in predicted fatalities, given no observed eagles, as a function of survey effort for different sized wind farms.

<<<<<<< HEAD

Conclusions

Primary Takeaways

The use of prior distributions of exposure probability can strongly affect the predicted eagle take at wind energy facilities. Among 26 wind farms, the average difference between 80th percentiles of Bayesian versus site-specific estimates of eagle fatalities was 3.91 eagles per year (max = 15.39, s.d. = 4.65). Across our range of simulated values for eagle activity rates and survey effort, site-specific and Bayesian fatality estimates differed by as much as 11.12 eagles per year.

The discrepancy between site-specific and Bayesian estimates of predicted fatality increases as the eagle activity observed at a site is more extreme. Controlling for the effect of survey effort, the discrepancy between estimates increases by 0.31 eagles for every standard deviation an observed level of eagle activity falls from the prior mean.

Survey effort can also change the magnitude of this effect. At the minimum survey effort required by FWS, the difference between site-specific and Bayesian fatality estimates ranged from -11.12 to 6.28 eagles per year. For a site with minimum eagle exposure rates (0.01 min/hr*km3), adding an additional survey hour per month, or survey plot, to the FWS minimum will decrease discrepancy by 0.48 predicted annual fatalities, on average.

Recommendations

The relationship between deviation of observed exposure rates from the mean of the prior and estimate discrepancy suggest thresholds for trigger points at which FWS may want to consider a different approach to permitting and mitigation, or require additional survey effort. These thresholds should be based on the standardized distance initially observed eagle activity rates at a site are from the mean of the prior distribution.

Increasing minimum survey effort - either number of plots, or number of hours - will reduce the influence of general priors, increasing confidence in the posterior estimates. This will benefit both wind developers, who can be less skeptical that mitigation requirements are being artificially inflated, and FWS, which can be more confident that Bayesian priors are not underpredicting fatality rates at sites with high eagle activity.

The addition of covariates to the prior distribution of exposure probabilities may also help alleviate the effect of priors on predictions at sites with extreme observed eagle activity. Rather than integrating site specific values with prior information from all wind projects, they could be integrated with distributions from sites sharing similar characteristics.

Discussion

An advantage of Bayesian modeling is to moderate the effect of random outlier observations. However, this also means that where surveys accurately detect extreme observed eagle activity, this site-specific information can be effectively washed out. Thus, an important question is when do extreme eagle activity levels observed during surveys accurately reflect site-specific eagle exposure, rather than representing random anomallies. At two of four sites where Bayesian estimates were lower than site-specific estimates, the observed fataility estimate fell outside of the bayesian 80th percentile using priors, but was covered by the 80th percentile of the site-specific estimate. These sites reported 2 of the top 3 eagle exposure rates, and illustrate instances where site-specific information should carry more weight. In both cases, survey effort was in the lower 25% of all sites. Permits are re-evaluated every 5 years. Thus, the mean observed discrepancy of 4 fatalities per year could equate to the unexpected take of an additional 22 eagles before models are updated. FWS advises that if permitted eagle take exceeds 1% of the estimated population size of either species within the LAP area, additional take is a concern. If take exceeds 5% of the estimated population size within the LAP area, additional take is considered inadvisable. Cummulative authorized take must not exceed 5% of local populations. Underestimating eagle take by a few individuals per year over a 5 year permit, could approach these thresholds. The inverse is not ideal from the perspective of regulated entities, as wind developers could pay mitigation costs for an unwarranted 5 additional incidental takes.

It should be noted that our Bayesian fatality estimates were generated by integrating observed measures of eagle flight time and survey effort with prior defined by the mean of these values across survey sites. Survey effort is used to adjust the rate parameter of the gamma distribution used to define the eagle exposure prior. Thus, increased survey effort leads to a narrower posterior distribution, and smaller 80% CI. In practice, this makes sense, as we have greater confidence that the results of more intensive surveys are reflective of consistent patterns, rather than conditions during a limited number of instances.

=======
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

Methods

We generated posterior distributions for predicted eagle fatalities from empirical and simulation data using the FWS Bayesian model. The prior distribution on eagle exposure used by FWS is a gamma distribution Exposure ~ Gamma (11.82, 9.76), with shape and rate parameters represented by eagle flight time in minutes, and survey effort in (hr km3). The prior distribution on collisions used by FWS is a beta distribution Collision ~ Beta (1.64, 290.02) with parameters representative of number of collisions and minutes of exposure.

Empirical Data

<<<<<<< HEAD

The prior distribution on eagle exposure used by FWS is a gamma distribution, with shape and rate parameters represented by eagle flight time in minutes, and survey effort in (hr km3). Appendix A in Bay et al. (2016) initially presented survey effort data as plot area in hectares, and observation time in minutes. We converted these to Survey effort (hr km3) by multiplying plot area (ha) by 0.2 km/ha, times the reported observation time converted to hours.

We derived an expansion factor, which was not reported in Bay et al. (2016), to produce fatality estimates in numbers of eagles per year. We quantified the relationship between ‘Flight Risk Area’, and predicted ‘Collisions per Annuum’ reported in Bay et al. (2016) using a generalized linear model. First, we divided Collisions per Annuum reported at each site by the site exposure, and mean of the prior collision rate distribution to obtain the site-specific collision rates per eagle activity rate. We modeled this rate as a quadratic funciton of site risk area.

glm(data = Bay_16, (COLLISIONS/(FLIGHT_MIN/EFFORT))/0.002895415 ~ RISK_HA + RISK_HA^2)
=======

We used wind energy site survey data from Appendix A in Bay et al. (2016) to obtain example survey values. This data recorded plot area in hectares, and observation time in minutes. We converted these to survey effort consistent with the units used by FWS (hr km3) by rescaling plot area (ha) by 0.2 km/ha, and multiplying by the reported observation time in hours.

We also estimated the expansion factor (not reported in Bay et al. (2016)) for each site to produce fatality estimates in numbers of eagles per year. We quantified the relationship between ‘Flight Risk Area’, and predicted ‘Collisions per Annuum’ reported in Bay et al. (2016) using a generalized linear model. First, we divided Collisions per Annuum reported at each site by the site exposure, and mean of the prior collision rate distribution to obtain the site-specific collision rates per eagle activity rate. We modeled this rate as a quadratic funciton of site risk area.

glm(data = Bay_16, (COLLISIONS/(FLIGHT_MIN/EFFORT))/0.002895415 ~ RISK_HA + RISK_HA^2)
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef

The coefficients from this model were then used as the expansion factor for each site. Thus we calculate the flight risk area as -2.255810^{4} + 2306 * Risk Area + -2.61 * (Risk Area) 2.

Simulation Data

We generated hypothetical values for survey effort based on the minimum requirements provided by U.S. FWS for pre-construction monitoring. FWS requires at least one cylindrical survey plot with radius > 800m and height > 200m, and that plots be surveyed for at least 12 hours per year, for two years. Thus, the minimum values for survey area and time were 0.402 km3, and 24 hrs, respectively. We simulated up to five plots (area = 2.01) in increments of one, and up to 240 hrs in increments of 12, and calculated survey effort for all combinations.

Observed eagle flight time (min) provided in Bay et al. (2016) is a function of survey effort and eagle activity at a site. Therefore, to generate a range of potential exposure values (min/hr*km3), we divided flight time by effort at each survey site to obtain an effort invariant measure of eagle exposure rate, and took a random sample of 20 values. The final simulation data included all combinations of survey effort and eagle exposure rate, which we multiplied to obtain flight minutes, producing 1000 values.

flight <- sample(BAY16$MIN/Bay16$EFFORT, 20)
survey_time <- seq(1, 10, 1)*12*2
survey_area <- seq(0.402, 2.01, 0.402)
df <- expand.grid(TIME = time, AREA = area, eagle_rate = flight)
df$beta <- df$TIME*df$AREA
df$alpha <- df$eagle_rate*df$beta

Model Output

<<<<<<< HEAD

We generated posterior distributions for predicted eagle fatalities using the FWS Bayesian model. We use the updated FWS priors for collision rate, Collision ~ Beta(9.38, 3224.51), and a prior exposure probability distribution defined by the survey data, Exposure ~ Gamma(a, b), where a represents observed eagle flight minutes, and b represents survey effort, following Bay et al. (2016). We defined a and b for the exposure prior as the mean values from survey data. To obtain posterior estimates, we drew 100,000 random samples from each of these distributions and collected their products to form the posterior distribution of eagle fatality rates.

prediction <- function(iters, alpha, beta){
 out <- data.frame(collision = rep(NA,iters),
                   exposure = rep(NA, iters),
                   fatality = rep(NA, iters)
                   )
 for(n in 1:iters){
   c <- rbeta(1, shape1 = 9.38, shape2 = 3224.51)
   e <- rgamma(1, shape = alpha, rate = beta)
   f <- c*e
   out[n,] <- c(c,e,f)
 }
 return(out)
}

We generated predictions at each survey site using only data from that site, in which case a and b were set to the observed values, and by updating the prior exposure distribution with observed values following Bay et al. (2016). We applied this same procedure to all 1000 combinations of simulated survey effort and eagle exposure, multiplying the predicted fatality rates by the mean expansion factor derived from survey data to obtain distributions of predicted annual fatalities.

#Predictions integrating Exposure prior and survey data
prediction(100000, a + mean(Bay16$FLIGHT_MIN), b + mean(Bay16$EFFORT))

#Predictions from survey data
prediction(100000, a, b)
=======

We generated eagle fatality predictions using only simulated or empirical values for observed eagle minutes and survey effort, and by updating the prior probability of exposure using these values. We obtained the posterior exposure distribution as in New et al. (2013) Exposure ~ Gamma(a + minutes, b + effort), where a and b are the shape and rate parameters from the prior distribution.

# simFatal funciton from New et al. (2013) modified to perform multiple iterations.
  # BMin:     observed number of bird minutes
  # Fatal:    annual avian fatalities on an operational wind facility
  # SmpHrKm:  total time and area surveyed for bird minutes
  # ExpFac:   expansion factor
  # aPriExp:  alpha parameter for the prior on lambda
  # bPriExp:  beta parameter for the prior on lambda
  # aPriCPr:  alpha parameter for the prior on C
  # bPriCPr:  beta parameter for the prior on C
  # iters: number of sampling iterations

simFatal <- function(BMin=-1, Fatal=-1, SmpHrKm, ExpFac, aPriExp=1,
                     bPriExp=1,aPriCPr=1, bPriCPr=1, iters){
  out <- data.frame(collision = rep(NA,iters),
                    expose = rep(NA, iters),
                    fatality = rep(NA, iters)
  )

  # The default of a negative value for BMin or Fatal indicates no data
  if(BMin>=0){
    aPostExp <- aPriExp + BMin
    bPostExp <- bPriExp + SmpHrKm
  }else{
    aPostExp <- aPriExp
    bPostExp <- bPriExp}
  # Update the collisions prior
  if(Fatal>=0){
    aPostCPr <- aPriCPr + Fatal
    bPostCPr <- ((rvmean(Exp) * ExpFac) - Fatal) + bPriCPr
  }else{
    aPostCPr <- aPriCPr
    bPostCPr <- bPriCPr}

  for(i in 1:iters){
    Exp <- rgamma(n=1, aPostExp, bPostExp)
    CPr <- rbeta(n=1, aPostCPr, bPostCPr)
    Fatalities <- ExpFac * Exp * CPr
    out[i,] <- c(CPr, Exp, Fatalities)
  }
  return(out)
}


estimates <- function(iters, a, b){
  out <- simFatal(BMin = a,
                  Fatal = -1,
                  SmpHrKm = b,
                  ExpFac = mean(Bay16$SCALE),
                  aPriExp = 11.81641,
                  bPriExp = 9.7656250,
                  aPriCPr = 1.638029,
                  bPriCPr = 290.0193,
                  iters = iters)
  fatality <- mean(out$fatality)
  q80 <- quantile(out$fatality, c(0.8))
  out2 <- simFatal(BMin = a,
                   Fatal = -1,
                   SmpHrKm = b,
                   ExpFac = mean(Bay16$SCALE),
                   aPriExp = 0,
                   bPriExp = 0,
                   aPriCPr = 1.638029,
                   bPriCPr = 290.0193,
                   iters = iters)
  fatality2 <- mean(out2$fatality)
  q82 <- quantile(out2$fatality, 0.8)
  return (c("MN_F" = fatality, "U_F" = q80, "MN" = fatality2, "U" = q82))
}

We used an iterative sampling procedure to produce a posterior distribution of annual eagle fatalities. During each iteration, we drew a random sample from the updated exposure distribution and the prior collision probability distribution, and multiplied these values together with a site specific expansion factor. For simulated values, we used the mean expansion factor from the empirical data. For each site and set of simulated values, we performed 100000 iterations of this sampling procedure, and calculated the mean and 80th percentile values of the resulting distribution.

#Predictions integrating Exposure prior and survey data
sim <- plyr::mdply(df[, c(5, 4)], estimates, niters = 100000)

#Predictions from survey data
emp <- vapply(1:nrow(Bay16), function(x){
  estimates(100000, Bay16$FLIGHT_MIN[x], BAY16$EFFORT[x])
  return (c(fatality, q80, fatality2, q82))
  },
  USE.NAMES = FALSE, FUN.VALUE = c(0,0,0,0))
)
>>>>>>> b0985afcffcde2cbf39fc1ecc76a55d786dd78ef